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VIDFO qnpiM^ 

This invention relates to the encoding of video signals, in particular for 
transmission over low bandwidth transmission systems e.g. those that require less 
5 than 128kbit/s. 

Around 6MHz of analogue bandwidth or a data rate in excess of 
100Mbit/s for digita. information is required to transmit broadcast qua.ity television 
s,gna.s uncompressed. Such high bit rate signals are impracticab.e to transmit by 
limited bandwidth systems. Therefore it is necessary to reduce the amount of 
10 .nformation by compressing the video signal. This can be done by taking 
advantage of the correlation between neighbouring elements (pixels) of a picture 
and compromising between the reduction in data and the quality of the picture. 

Video compression techniques rely on the correlation between 
neighbouring pixels, either in space and/or in time. For instance, in an area of a 
1 5 scene which is relatively uniform (for instance a wall of a room), the similarity of a 
pixel with neighbouring pixels within the area is likely to be fairly close. Similarly, 
in a fairly static scene, the pixels of one frame will correspond closely with the 
equivalent pixels of a subsequent frame. Hence pixels of a single frame can be 
coded with the respect of their relationship to each other (intra-frame coding) 
20 and/or with respect to their relationship with pixels of neighbouring frames (inter- 
frame coding). Intra-frame coded frames (intra-frames) can clearly be decoded 
without reference to any other frame whilst inter-frame coded frames (inter- 
frames) require information in relation to the frames used in the prediction. 
Differential techniques may also be used to compress video signals further. 
25 Differential coding techniques code only the differences between frames so 
meaning that less data is required than that required to encode the whole frame. 

Various video compression standards have developed. One such is the 
International standard ISO-IEC11172 "Coding of Moving Pictures and Audio for 
Digital Storage Media at up to About 1.5Mbit/s", known as MPEG 1 . This standard 
30 was developed for the storage of video and associated audio on digital storage 
media such as CD-ROM, digital audio tape (DAT), tape drives, writable optical 
drives or for transmission over telecommunications channels. According to the 
MPEG1 standard intra-frames are included in the data stream at regular intervals to 



WO 98/26604 

PCT/GB97/03296 

2 

a..ow random access to the video signa, and editing thereof. The regu.ar intra 
frames a,so a,,ow interaction with the video signa. i.e. a user can re": 

5 r ■ frames ° f the — *- » - - «. norm p : : v 
t n i , r ever this increases the avera9a ■* - p - - ™ o 

th-orma, p.ay m0 de of the video signa, which inCudes bot h intra- and inter 
Standards 

y ~ iwws, uaia lclte systems have also evolved For 

ISDN network) .nd 1TU „.„d.,d H . 263 fo , co<) , ng „„„ " 

I... *-, «*«. „„ con^nicnon over . PSTN ^ ^ J « 

.ch.™, do no, ,000,0 ,h. r. 9 u la , ,„o,o si on „, in,,..,™., 0urlng ,„„ ^ j 

15 d^oTdl ''J"" 3 -'™ 08 "! **-«' «— *- » -or condttons or. 
1 5 de.ec.ed by ft. reaver „ p„ ssib ,y on 5 cen. e„.„ g . s . WM„ ,„,. „„.„, that ,„ e 

overage dot, ro,. ie roduood comp.red ,„ MPEG1 <,„. do., produced fo, ,„„,. 
frames ,cp,. s . mino o sig „M„n, B ,op ort „„ of , h „ tot „ „ „ 

p,ov,d. ,n,e„c«v. ,.c„„i. s s „o„ „ st (orward „ reverse s|nce ^ 

aocos, p„„„ ,„ d .„ odro, ,„,„ „ „. start . ^ ^ amJ ^ 

20 " n °' SU " 3bte <™*»"» video „,.,„, * wnioh J 

interact. 

«n the artfde entitled "Robust H.263 Compatib.e Transmission for Mobi.e 
Vdeo Server Access" by N Farber, E Steinbach and B Griod presented at the IEEE 

25 L F ' rSt ' ntemati ° na ' W ° rkShOP °" -age/Video Communications held at 

25 Loughborough University in the UK on September 4-5 1996 a method of ai.owing 
random access to an H.263 encoded video stream is described which does no! 
regu-re the overhead of periodic intra-frames. According to this article, two data 
streams are stored, one ,a so-caHed P-stream, consisting of inter-frames which in 
norma, play mode, is transmitted. The second data stream , a so-cal.ed .-stream, 
30 cons.sts of mtra-frames. This is primary used for error recovery but it is noted 
that random access of the video signa.s may be provided on reguest bv the user by 
■nsert.ng .ntra-frames into the transmitted signa,. Thus in norma, use an inter- 
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ra ma eoded s , 3 „a, varthou, 0 „ iodica , ^ ^ 

be,n g trotted when tandom acceaa to the video atream „ r . qulred 

A, acknow.ed 9 ed by the authora. the insMon 
_e a ,„„, whlch re5ujts ^ ~ 

5 ™. ,eoo„ *o m ,He ao.ca„ed P- stTOm and the (rama £ 
ft. oo„..po„d,„ g ,„, ra ., rarna . The author , TO , hat lo raduc . m . J 
mr,. ,,™ ,a eneoded ,r. m the ,a=„„ aWctad , rama „ th . p . stream 
from the original video signal. 

According to the invention there is provided a method of encoding a video 
1 0 s.gnal represent.ng a moving picture, the method comprising: 
receiving successive frames of a video signal; 

of th r in9 frame ° f Vid6 ° S, ' 9na ' " ferenCe t0 a P~*» ^me 
of the v.deo s.gna,, so generating a first set of data which, when reconstructed 
represents a first playback mode of the moving picture; 
15 coding frames of the video signal, so generating a second set of data 

wh.ch, when reconstructed, represents a second p.ayback mode of the moving 
picture; uv» ig 

decoding the first set of data to produce a first reconstructed video signal- 
decoding the second set of data to produce a second reconstructed video 

4\j signal; 

ca.cu.ating the differences between a frame of the second reconstructed 
v-deo Sign a, and a frame of the first reconstructed video signal corresponding to 
the fram e succeeding the frame of the second video signa. and, when the 

25 sToTdata 3 ^ ' thr8Sh0 ' d ' *" differenC6S *" *° a 

Thus, a frame reconstructed from the second and further set of data is a 
close match to a .inked frame of the P-stream. without a significant increase in the 
bit rate per frame for an intra-frame being required. 

30 h < Preferab ' V SeC ° nd ° f d3ta " 96nerated WiTh ° Ut ref — to any 
30 other frame of the video signa. i.e. it is intra-frame codec. These ,-frames may ' 

then be used to provide a fast forward and fast reverse facility 

The second set of data may be generated using a coarser quantisation 
step s,ze than that used to generate the first set of data and the further set of data 
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may be generated using a quantisation step intermediate that used to generate the 
,st set of data and that used to generate the second set of data. Thus" 
rarne coded picture can he encoded without exceeding the maximum " 

. ~r:rr et of data im ~ *-* - - ~~ ~ 

Preferably the further set of data i« , . . 

e „„ M ls combin ed with the frame of the 

^jr"^:^ ~" — ^uch frame and the 
— « c«cu,««J. and, when the differences exceed the 

hr shoM. the differences are coded, so generating a yet further set of da^a 
10 Preferab.y these combining, ca.cu.ating and coding steps are carded out iteratt y 

fxed number of times e.g. twice for each frame of the second reconstructed vTdeo 
S .gna. of until the differences are less that the threshold 

The invention wi,. now be described further by way of exampie on.y with 
1 5 reference to the accompanying drawings in which: 

Figure 1 shows an interactive visua. services system inc.uding encoded 
v.deo s.gnals encoded according to the invention; 

Figure 2 shows a b.ocx diagram of a' video encoder according to the 

fast f R T h d3ta 9enerated 3CC0rdin9 tQ inVention ' ^ -ab.es 

fast forward and fast reverse; 

Figure 4 5 ho„ s dTO ae ™r« BrJ accordin9 „ another embo(fim . m 
— „H,cK ,„ow s „„«, „„ fomard aod ^ 
pause availability; H 1 

25 Figure 5 shows how control lists are used in the play mode- 

^ Figure 6 is an example of elements of the controi lists shown in Figure 5; 

Figure 7 is an example of how the contro. lists are used to generate a 
bitstream by a video server. generate a 

30 Figure 1 shows an interactive video system such as a so-ca„ed video-on- 

demand system which provides bitstreams representing a video seiected by a user 
on demand by the user. The system comprises a video server 2 which co-operates 
-h a store 4 of encoded video signals representing moving pictures ^ 



invention; 

20 
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movies, documentaries etc. A p.uraHtv of user terminals 6 are connected to the 
video server 2 via transmission Iinks 7 such as the PSTN, ISDN . radio sat it^ 
« user ,s provided with means ,not shown, with which to centre, the video 

5 stanals Th " *" * » ^ 

t7 t n sm ' S t r rn bB ^ — — medium as that used 

to transm,t the s.gna.s from the server 2 to the termina. 6 or a separate 

transmission medium. separate 

sem „ " 3 """""<> P**™ » b. P.a»=d and , co„„„ s i 9 „„ is 

10 « g „, , d .„, i(iM the , ocatfa „ „ th , v , deo signa , ^ we s(ore ^ =» 

r ?° t 4 ,o ,he ,eques,in9 user , " minai 6 - — - - — «- 

ir 7! ,ermlna '' SerVerS '° orovld ' ,h,s «» « m -he 

f„.d o, v,d.o.on.de™„d and „i„ „„, 6e dlscusS(!d ^ ^ 
invention has particular effect on the server. 

15 The video signa.s in the store 4 may be encoded according to any 

appropriate coding scheme. 

Within the many current coding schemes, such as the H.261/263 and 
MPEG standards, there is a process known as quantisation to reduce the number 
of b,ts transmitted. The picture is transformed from the spatia. domain into the 
frequency domain using a Discrete Cosine Transformation (OCT,. The resu.ting 
data is scanned in an order of increasing frequency components after which it is 

thThT " ^ ' fiX6d nUmber '' ^ th ° Se ffiSU,tS ° f ^ d ' ViSi - »- ^an a 

th eshoid are discarded. The higher frequency components are usuaHy of iower 

25 r e ,ower frequencv componems s ° that ' °- ^ » * ««- 

many of the high frequency components are discarded. The number used as the 

nlT , T QUamiSer ^ ChOS6n ^ SnCOder t0 — that * -sonabie 
number of frames are transmitted to the channei. For examp.e, when there is a 

great dea. of detaii or movement in the video sequence to be encoded the encoder 

30 macrob,oc k used to encode the signa,. The use of a coarse quantiser has other 
unwanted effects; in the e.ements of the picture that are transmitted the decoded 
coefficient values have large steDS result.™ \ n • 

9 PS ' resu,t,n 9 an image which has a low signal- 
to-no.se ratio and appears 'blocky'. 
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comprises an input 1 1 an <.n^' 'nvent.on. The encoder 

input 11. an encod.ng means 12 in the form of a DCT p,i.,,.i..- 
unit 14 which outputs DCT coefficients and a quantiser 16 l C h 
■ output of the OCT unit 1 4 . The output of J JZJ'.TjZT ^ 
vanao.e ,en g th coder ( VLC, 18 and a.so to an inverse quanJsT 2 The 
quant.sation step size of the quantiser 16 and ,h. ■ 

controltoH hw * , 6 ' nVerse ^ uan tiser 21 is 

controlled by a control unit 22. The VLr ia 

• ne vlc 18 is connected to the control unit 22 

VLC The con,,,, on,, modifies ,h. „u,n t is,,ion 5tep si „ „, „„ qu , ntis<>r 
he ,nvers, qU a„,, S e, 2 , to keep , he ^ ^ - 

p.cnrre ta, allowed by th . cnosen coding m P" 

For thi. reason, no, every frame o, video „„„„ „ „„ c 
eceasanyencoded. Th. comyo, „„„ 22 mey eaose „,r„ea ,o be dropped h 
15 amoun, of daf, required ,o encode e frame beoomaa ,00 9 re„ 

25 ,„ ^ r e ' Se qU ' n " Ser 2 ' *"""" ° CT U " K 24 <°™ ■ '<>«' *=od« 

25. ,„ coding ,„e pioure d.ooded by fhis focaf decoder ,a ,npu, ,0 a 

ram s.ore , F S, 25. WHen ,na „.„ frame t0 be en „ d . d „ ' " ' » 

encode, e, ,np„, ,0. , S ob,r,cor 28 cafcu,,,., ,H. di „ erences oetvv „ en ^ 

may „a. be cried „ by a convemiona, mo „o„ e s „ma,=r 3, and. „ M ,h, 
contents of the frame store 26 are m „„T ^ • 

vectors Such m •• " aCCOrdance the motion 

vecto^ Such mot.on est.mation is we,, known in the field of video coding and w«, 
not be discussed further herein. 

" addino » LT""' C0<,in3 ' ** C ° n,emS °' S <°" 26 « 1— by 

.dd,n 8 he ddferencea ceicoia.ed by ,h. sub ,rac,.r 28 ,o ,he frame a,ore 26 .. 

According ,o a firs, embodim.n, of ,he inven.ion fwhich wii, be described 

30 ,7 " T* " 31 e " COdin9 °' " *- -«*- severe, pas 

30 through the encoder: passes 

first sJof /, t P3SS t0 96nerate ' aCC ° rdin9 f ° reqU,Ved COdi "9 -"erne, a 
frst set of data 10 corresponding to a norma, p, ayback mode ^ 

instance ,n H.261 or H.263 coded video this hit stream wi„ consist of anlniti^ 
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frame followed by P frames. This first set of data 10 is stored in the local store 
34. 

2. A second pass to generate a second set of data 20 representing a 
second playback mode of the video signal e.g. fast forward or fast reverse. This 
5 pass generates an I frame sequence ITIT at a lower bit rate suitable for fast 
forward and reverse, together with entries in control lists of a control structure 
defining links to and from the first bitstream. The control unit 22 switches out the 
local decoder 25 from the encoding ioop. The control unit is programmed to 
generate this second set of data at a bit rate that is compatible with the required 
10 fast forward/ fast reverse playback mode. For example, if a fast forward/fast 
reverse playback mode at 10 times the normal speed is required, the second set of 
data is generated at 1/10 times the average bit rate of the normal playback mode. 
The second set of data 20 is also stored in the local store 34 with the associated 
control lists. 

1 5 3 - Subsequent passes to generate additional P' P" and P'" frames 30. 

An l-frame of the second set of data 20 is used as the starting point for the P' 
P"P"' sequence and a decoded frame I' of the second set of data 20 is loaded into 
the frame store 26 by the control unit 22. The P frame in the encoded normal 
bitstream 10 (i.e. the one produced in pass #1) at the point at which it is to be re- 
20 joined is decoded and applied to the input 11 of the encoder as P ln) . The 
subtractor 28 calculates the differences between the picture I' in the frame store 
26 and the reconstructed target P, nl frame of the normal playback bitstream. If the 
differences between the two are less that a predetermined threshold then the 
encode process will be stopped. If greater than the threshold, the differences are 
25 encoded as P' and stored in the local store 34 as a third set of data. P' is then 
decoded and added to the frame store 26 by the adder 32 and the control unit 
again initiates the calculation, encoding and storage of the difference between the 
contents of the frame store and the frame P lnl . This continues until the input to 
the encoder P ln , and the contents of the frame store 26 converge such that the 
30 differences between the two are less than the predetermined threshold. This data 
P',P", P'" is stored as third set of data. Thus a variable number of additional P' 
frames may be generated for each frame of the second playback mode. 
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In a simpler form of the invention the encrw 
—or o, framM . „ xed „ umb ; « ™ " « . «~0 

~- - - - — ■ - - « - - :;r e * De, * n " 

further set of data 30 P' P" p«" a n„. • and a 

♦~ ba . ai< .. j ^ JOP ' P < P a,,ow,n 9 ^ normal playback mode bj 

The intraframes for the fast forward/ fast reverse payback mode 

:r^;r::::i:;r;:r r; - — ~ - 

15 frame. The control unit 22 of the encoder has the freedom for u 

r - r , . tt ^ , ha guamls „ ::r;:rr;~ r 

after each If '•■P«*rv*y. The number of P frames encoded 

after each I frame ,s programmab.e at the time the materia, is encoded a „n ■ 
20 fundamental to the invention. * Bnd ' 3 " 0t 

^ 6aCh '* fra ™ * -coded with a re,ative,y coarse quantiser step size 
(.... .eve. 8 ,n the H.261 coding scheme, to ensure that the number of bits t 
re ; tr_ d to form the r frame fa„ within the maximum number of bits c 
limit of the codina schemo tk.v u * • "'wpicxure 

ana h. 263 specifications only allow for in™, 
pixel values. Thus to converge to the frame of th. m • „• 

r 6 ma,n b,t stream with which the 

I frame is indirectly) linked the r fr=,«,« • 
, ft ^. . , „ ' frame In P ra «ice is followed by 2 or more 

30 ,d dlt ,.„ a P ^ P. and p., p.., t0 sha , pan , he ° ™ 

co , h8 degrae of sharp „. ss obtalned by « * - 

<r™, Th. flm frama P . h3s . quant , ser 5tap sl;e J"'' 
P~ vo, Ues „ ,„ tha ^ (a ., |ave , 8 ,„ co , " s 2: a ' 
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the second frame P» has a finer quantiser step size (e.g. ,eve. 4 in the H 261 
codmg scheme) to achieve a picture quality comparable to that of the main 
bitstream. The lower range of pixel values is acceptable at this time as it is 
expected that the magnitude of any differences between the P' and P" frames is 
5 less than 255. A subsequent P< frame P'» uses an even finer quantisation step size 
(e.g. level 1 in the H.261 coding scheme). 

Among the advantages of this technique are that, when fast forward or 

fast reverse is requested, there is a lower delay until rh„ w , , . 

by the termina.. Successive I' frames continue to be transmitted until a control 
10 signal requesting the server 2 to resume the norma, p.ayback mode is received 
from the user terminal 6. The additional P' frames associated with the final r 
frame played are then transmitted to converge the signal to that represented by 
the next frame to be played from the normal bitstream. This technique can be 
extended to include any number of additional P frames after the |- frame with 
15 increasingly fine quantiser step size to converge to the optimum picture quality 
supported by the coding scheme. 

The final additional frame P" should have few differences from the 
normally encoded frame of the data 10 that it is linked to. as any differences will 
be perpetuated through the normal sequence of P frames that are transmitted for a 
20 considerable time. They will only be removed for a particular macroblock by the 
'systematic update' process specified as a part of the H.261 and H.263 standards. 

Thus, the coarsely quantised I' frames require fewer bits for transmission 
than standard l-frames of the MPEG or H.261 standards and therefore are 
transmitted with less delay. The subsequent additional P' frames remove the 
25 artifacts that result from the coarse quantisation and encode the large expected 
differences between the coarsely quantised I frame and the P< frame. The final P'" 
frame is quantised such that it is a good match with the normal bitstream. 

This enables fast forward and fast reverse without introducing additional 
artifacts into the signal to be displayed which would then be perpetuated into the 
30 norma, playback mode of the video signal. The speed up obtained in fast forward 
and fast reverse is defined at the time the sequence is encoded. If a high speed 
fast forward and fast reverse is required then there will be many P frames encoded 
between each I frame and there will be a noticeable amount of time between each 
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I frame. For example, if the stored material is encoded at a mean bit rate of 
128kbit/s and a speed up factor of 10 times is required in fast forward and 
reverse, then the I' frames will be encoded at a mean bit rate of 1 2.8kbit/s . 

When a pause is requested the user's terminal could display the currently 
5 decoded frame. However when the decoded frame is maintained on the user's 
display, any artifacts in the picture will be more obvious to a viewer. 

Alternatively therefore, the server 2 may continue to transmit the normal 
playback signal until the next link to an I' frame of the second set of data is 
reached. The I' frame and the subsequent P', P" and P'" frames are then 
10 transmitted for display. However if the fast forward has been encoded at a rate 
that is much faster than the normal playback rate of the video signal than there 
may be a significant delay before the next I' frame is reached. 

According to a second embodiment of the invention, a plurality of pause 
sequences PP'.PP-.PPP'" are encoded, relatively more frequently than the I frames 
15 for fast forward and fast reverse, as can be seen from Figure 4. Each PP' pause 
sequence 40a is encoded by calculating the differences between a reconstructed 
frame P„ of the normal playback mode and the corresponding frame F„ frame of the 
original video signal. The pause sequences 40 are also stored in the store 34 as a 
further set of data. 

20 The encoding process for pause sequences is similar to that used for the 

fast forward and fast reverse sequences except that, in this case, no initial I frame 
is generated: 

1. The first set of data 10 corresponding to a normal playback mode 
bitstream is decoded and frames loaded into the frame store 26 of the encoder 

25 from the local store 34. 

2. A further set of data 40 is generated representing a second playback 
mode i.e. pause. This further set of data is generated by inputting an original 
frame F, n , of the video signal into the encoder input 1 1 and loading the frame store 
26 with the corresponding decoded frame of the main bit stream 10. The 

30 differences between the two frames are calculated and, if greater than a threshold, 
encoded and stored as pause frames PP' in the local store 34 with associated 
control lists. 
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3. PP' is then decoded and added to the frame store 26 by the adder 
32 and the control unit again initiates the calculation of the difference between the 
contents of the frame store and the frame F in) . This continues until the input to 
the encoder F< nJ and the contents of the frame store 26 converge such that the 
5 differences between the two are less than the predetermined threshold. Thus a 
variable number of PP' pause frames may be generated for each frame of the 
pause mode. 

Clearly the final PP' frame generated has diverged from the corresponding 
frame in the normal bitstream 10. This may be dealt with in a number of ways: 

10 a) an entry in the control list associated with the final PP' of a particular 
pause point is linked to the next I' frame to occur in a fast forward data set e.g. in 
Figure 4 PP' is linked to I'<| ( as indicated by numeral 42 in figure 4); or 
b) further data is encoded to cause the picture represented by the pause 
frames PP' to converge to a reconstructed P frame in the normal playback 

15 bitstream 10. An entry in the link associated with the final PP' frame is thus linked 
to the next P frame, as indicated by numeral 44 in Figure 4. In the same way as 
described with reference to the first embodiment, the decoded frame of the main 
bitstream 10 with which the pause point is to be linked {e.g. the next frame P x + 1 ) 
is input to the encoder via input 1 1 whilst the decoded final pause frame PP'" is 

20 loaded into the frame store 26. As for the first embodiment, the differences 
between the input frame P x + 1 and the contents of the frame store 26 are encoded 
to form play resume frames and associated control lists until the two inputs to the 
subtracter converge or until a fixed number of iterations have occurred. 

Because interframes contain less bits than intraframes, the use of PP' 

25 frames alone for pause mode mean that more pause points may be inserted 
without a large increase in the volume of the video data stored on the server. 

Audio-visual material from a variety of sources may be encoded, the 
encoded bitstream being stored on a high capacity storage system, typically a hard 
disc drive or CD-ROM for storage at a later stage in a store 4 of a video server 2. 

30 Thus, in the invention, the encoding process occurs only at the time the 

material is processed and stored on the storage medium. The material may be 
stored at several rates that are required to support the transmission bandwidth and 
the terminal types that are to be supported. Typically the audio-visual material will 
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be transmitted over PSTN. ,SDN, LAN or ADSL networks and the encoding process 
is earned out for each required rate. 

The stored materia! may be accessed by any client terminal 6 that is 
capab.e of decoding the encoded bitstreams plus the associated audio coding 
5 algorithms. A contro. channe. 8 is required to enable control of the server to 
enab.e the fast forward, fast reverse, pause and other modes. Currently there is 
no ITU standard for this contro. mechanism. Figure 1 shows such an arrangement 
At the time the audio-visual materia! is requested by a client terminal, the 
material may be streamed directly to the standards comp.iant terminal, with no 
10 additional decoding/encoding/re-coding processing required by the server. Thus a 
comparatively inexpensive server can support many simultaneous accesses to the 
stored material. 

Commands from the client decoder are used to control the form of the bit 
stream that the server transmits. The constituents of the transmitted bitstream 
15 are control.ed by contro. lists he.d on the server. In Figure 5 FO to F9 represent 
the original frames in the material to be encoded and occur at a fixed frame rate 
Pn represent the normal playback mode predicted frames that will be re- 
constructed by a decoder and B(n. m) represents the bitstream that, when 
transmitted, enables the decoder to move from P„ to P 
20 The control lists are generated during encoding and held on a storage 

medium for access by a server from the store 4. The contro. lists 50 contain links 
from the different frames he.d in the store. There is one contro. list for each 
element of bitstream. labelled B(0,1) to B(N,N + 1). In Figure 5 the video stream is 
shown following the 'play' link from one control list to the next. 
25 An example of the elements in a control list is shown in Fig 6 Each 

contro. list or block 50 includes: a bit stream identifier 51; a time stamp 52 which 
.nd.cates the earliest time with respect to the start of the encoded signal at which 
a bit stream can be played out by a decoder; a file offset 53 which indicates the 
portion of the frame's compressed bitstream in the store; a bit stream size 54 
30 which indicates the number of bits in the frame; a play link 55 which indicates the 
.dentification of the next control list to be p.ayed in play mode; a p.ay backward 
link 56 which indicates the identification of the next control list to be played in 
reverse p.ay mode; a fast forward link 57 which indicates the identification of the 



WO 98/26604 

PCT/GB97/0329fi 

13 

next centre, .ist to be played in fast forward mode; a fast reverse « 58 which 
•n .cates the .dentification of the next contro, ,ist to be played in fast rev 

::t::::: se ,ink 60 which indicates the iden ~ * - - — , : 

to be , ayed In pause mode; ^ fl regtaft ^ jndjcates k 

5 .dent,f,cat,on of the next control .ist to be played when the o.av mnrf • 

H ycu wnen tne play mode is resumed. 
F.gure 7 shows the contro, lists 50 for a portion of an encoded video 
s.gna, Contro, , ists 501-506 represent the P frames of the norma, playback 1 
Contro. „sts 507-509 are associated with th„ r , . 

forward/fast rewind. Contro, lists 510 5! 8 ' 
m #ra 510-518 are assoctated with the additional P' 

normal playback bitstream. 

Figure 7 does not show the pause sequences PP' however it wi„ be 
appreaated that similar contro, Hsts are stored for this, and other, p(a y a 
mode(s) also as required. 

15 
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1 

5 



A method of encoding a video signal representing a moving picture, the 
method comprising: 

receiving successive frames of a video signal; 
coding each frame of the video signa. with reference to a preceding frame 
of the video signal, so generating a first set of data which, when reconstructed 
represents a first playback mode of the moving picture; 

coding each frame of the video signa., so generating a second set of data 
10 which, when reconstructed, represents a second p.ayback mode of the moving 
picture; 

decoding the first set of data to produce a first reconstructed video signal- 
decoding the second set of data to produce a second reconstructed video 

signal; 

1 5 ca«cu.ating the differences between a frame of the second reconstructed 

video signa. and a frame of the first reconstructed video signal corresponding to 
the frame succeeding the frame of the second video signal and, when the 
differences exceed a threshold, coding the differences and so generating a further 
set of data. 

20 

2. A method according to claim 1 wherein the second set of data is 
generated without reference to any other frame of the video signal. 

3. A method according to claim 1 or 2, wherein the second set of data is 
25 generated using a coarser quantisation step size that that used to generate the first 

set of data and the further set of data is generated using a quantisation step 
mtermediate that used to generate the first set of data and that used to generate 
the second set of data. 

30 4. A method according claim 1, 2 or 3, further comprising combining the 
further set of data with the frame of the second reconstructed video signa. 
ca.cu.ating the differences between such frame and the frame of the first video 
s.gna.. and, when the differences exceed the threshold, coding the differences and 
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so generating a yet further set of data, said combining, casting and coding step 
being earned out iteratively until a criterion is met. 

5. A method according to claim 4, wherein the criterion is that the steps are 
5 carried out twice for each frame of the second reconstructed video signal. 

6 A method according to Cairn 4, wherein the criterion is that the 

differences are less than the threshold, 

10 7. A method according to claim 1 wherein the second set of data represents 
a pause mode of the video signa., the pause mode being generated by iteratively 
encod,ng the differences between a decoded frame of the first set of data and the 
corresponding frame of the original video signal. 

15 8. Video signal encoding apparatus comprising: 

an input for receiving successive frames of a video signal 

encoding means and a quantiser for quantising the output of the encoding 

means; 

an inverse quantiser for inverse quantising the output of the quantiser and 
20 decoding means for decoding the output of the inverse quantiser; 

a subtracter for calculating the differences between a frame stored in a 
frame store and a frame input to the encoding apparatus, the encoding means 
bang arranged to encode the differences, when the differences exceed a 
threshold, and so generate a further set of data; 

control means arranged to control the inputting of a decoded inter-frame 
video signal into the encoding apparatus and the inputting of every frame of the 
video signal into the frame store, such that the frame of the video signal input to 
the encoder corresponds to a frame of the video signa. succeeding the frame 
stored in the frame store and to control the quantiser and the inverse quantiser so 
30 as to modify the associated quantisation step size. 

9. Apparatus according to claim 8 wherein the control means is arranged to 
modify the quantisation step size such that the video signa. input to the frame 
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store is generated using a coarser <uantisat.cn step size that that used to generate 
the ,nput to the encoder and the further set of data is generated usinT 
guant.sat.on step mtermediate the two previous.y used quantisation step size 
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Apparatus according to c.aim 8 or 9 wherein the video signai input to the 
frame store ,s generated without reference to any other frame of the video signa.. 

11. Apparatus according claim 8, 9 or ,0, the apparatus bain- * . 

combine the further set of data with the frame stored in the frame store Z7orJ a 
10 comb,ned frame, to ca.cu.ate the differences between such combined frame and 
the said frame of the video signa, input to the encoder, and, when the differences 
exceed the thresho.d, to encode the differences and so generate a yet further set 
of data, the contro. means being arranged to cause said combining, ca.cu.ating and 
cod.ng step to be carried out iteratively until a criterion is met 

15 

12. Apparatus according to c.aim 11. wherein the criterion is that the steps 
are carried out twice for each frame of the second reconstructed video signal. 

13 A method according to c.aim 11, wherein the criterion is that the 
ZO differences are less than the thresho.d. 

1 4. Interactive video apparatus including apparatus as c.aimed in any of Cairns 
8 to 13. 

25 15. A video signal representing a moving picture, signal comprising- 

a first set of data which, when reconstructed, represents a first p.ayback 
mode of the moving picture; 

a second set of data which, when reconstructed, represents a second 
playback mode of the moving picture; and 

a third set of data which represents the differences between a 
reconstructed frame of the second set of data and a reconstructed frame of the 
frst set of data corresponding to the frame succeeding the reconstructed frame of 
the second set of data. 
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